* Title: 



NNFWG2 



AAW48713; 

13-OCT-1998 (first entry) 
HPIV-3 Vero cp45 vaccine L protein. 

L protein; attenuation; non-segmented; negative sense; vaccine; immunity; 
single stranded RNA virus; Mononegavirales. 

Human parainfluenza virus. 

WO9813501-A2. 



02-APR-1998. 
19-SEP-1997; 
27-SEP-1996; 



97WO-US16718. 
96US-0026823. 



RESULT 7 
AAW48713 

ID AAW48713 standard; Protein; 2233 AA. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 



(AMCY ) AMERICAN CYAN AM ID CO. 
(USSH ) US DEPT HEALTH & HUMAN SERVICES. 

Murphy BR, Randolph VB, Sidhu MS, Tatem JM, Udem SA; 

WPI; 1998-230710/20. 
N-PSDB; AAV18274. 

Recombinantly-generated, attenuated, non- segmented, negative-sense, 
single stranded RNA virus of order Mononegavirales - having 
attenuating mutation in 3' genomic promoter region and RNA 
polymerase gene, useful as vaccine to immunise against such virus 

Disclosure; Page 283-291; 426pp; English. 

This sequence represents the Human parainfluenza virus (HPIV-3) type 3 
vaccine Vero cp45 L protein. This sequence is used in a method which 
involves the isolation of recombinantly-generated, attenuated, 
non-segmented, negative-sense, single stranded RNA virus of the order 
Mononegavirales which have at least 1 attenuating mutation in the 3' 
genomic promoter region and at least 1 attenuating mutation in the RNA 
polymerase gene. This RNA virus can be used as a vaccine to immunise an 
individual against such a virus. 

Sequence 2233 AA; 



Query Match 100.0%; Score 35; DB 19; Length 2233; 

Best Local Similarity 100.0%; Pred. No. 1.3e+03; 
Matches 5; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy .'1 NNFWG 5 

I I I I I 

Db 1726 NNFWG 1730 



Title: 



NNIWG4 



RESULT 4 
GUNS ERWCA 



ID 

AC 

DT 

DT 

DT 

DE 

DE 

GN 

OS 

OC 

OC 

OX 

RN 

RP 

RC 

RX 

RA 

RT 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

DR 

DR 

DR 

DR 

DR 

KW 

FT 

FT 

SQ 



GUNS_ERWCA 
P16630; 
01-AUG-1990 
01-AUG-1990 
28-FEB-2003 



STANDARD; 



PRT; 264 AA. 



(Rel. 15\ Created) 
(Rel. 15,\Last sequence update) 
(Rel. 41, Vast annotation update) 
Endoglucanase S precursor (EC 3.2.1.4) (Endo-1 , 4-beta-glucanase S) 
(Cellulase S) . x 
CELS. 

Erwinia carotovora . 
Bacteria; Proteobacteria; ^ammaproteobacteria; Enterobacterial; 
Enterobacteriaceae; Pectoba^cterium. 
NCBI_TaxID=554; 
[1] 



SEQUENCE FROM N.A. 
STRAIN=SCC3193; 
MEDLINE=90337352; 



AND PARTIAL SEQUENCE. 
PubMed=2379* 



37; 

Saarilahti H.T., Henrissat B.,\Palva E.T.; 

"CelS: a novel endoglucanase identified from Erwinia carotovora 
subsp. carotovora."; 
Gene 90: 9-14 (1990) . 
-!- CATALYTIC ACTIVITY: 



EndohydrolVsis of 1, 4-beta-D-glucosidic 
linkages m cellulose, lichenin\and cereal beta-D-glucans . 
-!- SIMILARITY: BELONGS TO CELLULASE\FAMILY H (FAMILY 12 OF GLYCOSYL 
HYDROLASES) . 

This SWISS-PROT entry is copyright. It/\s produced through a collaboration 
between the Swiss Institute of Bioin/orWics and the EMBL outstation - 
the European Bioinf ornaatics Institute. T\ere are no restrictions on its 
use . b ^ non-profit institutions fs long \as its content is in no way 
modified and this statement is no/ removed A Usage by and for commercial 
entities requires a license agre/ment (See h\tp : //www. isb-sib . ch/announce/ 
or send an email to license@isb/sib . ch) 

EMBL; M32399; AAA24817.1; 
PIR; JU0328; JU0328. 
InterPro; IPR002594; Glyco/hydro_ 
Pfam; PF01670; Glyco_hydr/_12 ; 
ProDom; PD004316; Glyco JkydroXl; 

Cellulose degradation; vSydroZase; Glycosidase; SignaA 
SIGNAL 1 32/ / 

CHAIN 33 2m X ENDOGLUCANASE S. 

SEQUENCE 264 AA; /2^757 MW; E6D61388950C77AA CRC64; 



Query Match 
Best Local Similarit 



Matches 



Qy 



Db 



4; Cons 

1 NNIWj? 
I I 

53 HNVWG 57 



97.0%; 
80.0%; 
rvative 



Score 32; DB 1; 
Pred. No. 28; 
1; Mismatches 



Indels 



0; Gaps 



0; 



Title: 



NNLWG1 



RESULT 1 
US-08-774-065-6 

Sequence 6, Application US/08774065 
Patent No. 5989899 
GENERAL INFORMATION: 

APPLICANT: Bower, Benjamin 
APPLICANT: Clarkson, Kathleen 
APPLICANT: Larenas, Edmund 
APPLICANT: Ward, Michael 

TITLE OF INVENTION: NOVEL OVERSIZED CELLULASE COMPOSITIONS 
TITLE OF INVENTION: FOR USE IN DETERGENT COMPOSITIONS AND 
TITLE OF INVENTION: IN THE TREATMENT OF TEXTILES 
NUMBER OF SEQUENCES: 16 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: GENENCOR INTERNATIONAL 
STREET: 925 PAGE MILL ROAD 
CITY: PALO ALTO 
STATE: CALIFORNIA 
COUNTRY: UNITED STATES 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/774,065 
FILING DATE: 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY /AGENT INFORMATION: 
NAME: Glaister, Debra J. 
REGISTRATION NUMBER: 33,888 
REFERENCE/DOCKET NUMBER: GC368 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-84 6-7 620 
TELEFAX: 415-845-6504 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 77 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
US-08-774-065-6 

Query Match 100.0%; Score 33; DB 2; Length 77; 

Best Local Similarity 100.0%; Pred. No. 24; 

Matches 5; Conservative 0; Mismatches' 0; Indels C 

Qy 1 NNLWG 5 

I I I t I 

Db 68 NNLWG 72 



RESULT 2 
US-08-774-065-2 

Sequence 2, Application US/08774065 
Patent No. 5989899 
GENERAL INFORMATION: 

APPLICANT: Bower, Benjamin 
APPLICANT: Clarkson, Kathleen 
APPLICANT: Larenas, Edmund 
APPLICANT: Ward, Michael 

TITLE OF INVENTION: NOVEL OVERSIZED CELLULASE COMPOSITIONS 
TITLE OF INVENTION: FOR USE IN DETERGENT COMPOSITIONS AND 
TITLE OF INVENTION: IN THE TREATMENT OF TEXTILES 
NUMBER OF SEQUENCES: 16 
CORRESPONDENCE ADDRESS: 



ADDRESSEE : GENENCOR INTERNATIONAL 
STREET: 925 PAGE MILL ROAD 
CITY: PALO ALTO 
STATE : CALIFORNIA 
COUNTRY: UNITED STATES 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/774,065 

FILING DATE: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 

NAME: Glaister, Debra J. 

REGISTRATION NUMBER: 33,888 

REFERENCE/DOCKET NUMBER: GC368 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-846-7620 

TELEFAX: 415-845-6504 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 136 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
US-08-774-065-2 



Query Match 100.0%; Score 33; DB 2; Length 136; 

Best Local Similarity 100.0%; Pred. No. 42; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 NNLWG 5 

I I I I I 

Db 35 NNLWG 39 



RESULT 3 

US-09-216-295-11 

Sequence 11, Application US/09216295 
Patent No. 6268328 
GENERAL INFORMATION: 
APPLICANT: Mitchinson, Colin 
APPLICANT: Wendt, Dan J. 

fS E RE™e™™gC555°- 626832861 Varlant EGIII - Ute CellUl " e a-*-"!". 

CURRENT APPLICATION NUMBER: US/09/216,295 
CURRENT FILING DATE: 1998-12-18 
NUMBER OF SEQ ID NOS : 41 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 11 
LENGTH: 194 
TYPE: PRT 

ORGANISM: Chaetomium brasiliense 
US-09-216-295-11 



Query Match 100.0%; Score 33; 

Best Local Similarity 100.0%; Pred. No. 



Matches 5; Conservative 

Qy 1 NNLWG 5 

I I I I I 

Db 4 5 NNLWG 4 9 



DB 3; 
60; 



0; Mismatches 



Length 194; 
0; Indels 0; Gaps 



0; 



RESULT 4 

US-08-032-848C-10 



Sequence 10, Application US/08032848C 
Patent No. 5475101 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 




Ward, Michael 
Clarkson, Kathleen A. 
Weiss, Geoffrey L. 
Larenas, Edward 
Lorch, Jeffrey D. 
TITLE OF INVENTION: Purification and Molecular Cloning of 
TITLE OF INVENTION: EG III Cellulase 
NUMBER OF SEQUENCES: 20 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Genencor International 
STREET: 180 Kimball Way 
CITY: South San Francisco 
STATE: CA 
COUNTRY : USA 
ZIP: 94080 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/032, 848C 
FILING DATE: MAR 17 1993 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: Horn, Margaret A. 
REGISTRATION NUMBER: 33,401 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415 742-7356 
TELEFAX: 415 742-7217 
INFORMATION FOR SEQ ID NO: 10: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 218 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-032-848C-10 

Query Match 100.0%; Score 33; DB 1; Length 218; 

Best Local Similarity 100.0%; Pred. No. 67; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 NNLWG 5 

I I I I I 

Db 19 NNLWG 23 



RESULT 5 

US-08-438-870-10 

Sequence 10, Application US/08438870 
Patent No. 5753484 
GENERAL INFORMATION: 

APPLICANT: Ward, Michael 
APPLICANT: Clarkson, Kathleen A. 
APPLICANT: Weiss, Geoffrey L. 
APPLICANT: Larenas, Edward 
APPLICANT: Lorch, Jeffrey D. 

TITLE OF INVENTION: Purification and Molecular Cloninq of EG 
TITLE OF INVENTION: III Cellulase 
NUMBER OF SEQUENCES: 11 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Genencor International 
STREET: 180 Kimball Way 
CITY: South San Francisco 
STATE: CA 
COUNTRY: USA 
ZIP: 94080 



COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS /MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/438,87 0 

FILING DATE: May 10, 1995 

CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: Christopher L. Stone 

REGISTRATION NUMBER: 35,696 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415 742-7555 

TELEFAX: 415 742-7217 
; INFORMATION FOR SEQ ID NO: 10: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 218 amino acids 

TYPE: amino acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-438-870-10 



Query Match 100.0%; Score 33; DB 1; Length 218; 

Best Local Similarity 100.0%; Pred. No. 67; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 NNLWG 5 

I I I I I 

Db 19 NNLWG 23 



RESULT 6 

US-08-169-948B-34 

Sequence 34, Application US/08169948B 
Patent No. 5861271 
GENERAL INFORMATION: 

APPLICANT: Fowler, Timothy 
APPLICANT: Ward, Michael 
APPLICANT: Clarkson, Kathleen 
APPLICANT: Collier, Katherine 
APPLICANT: Larenas, Edmund 

TITLE OF INVENTION: No. 5861271el Cellulase Enzymes and Systems 
TITLE OF INVENTION: For Their Expression 
NUMBER OF SEQUENCES: 4 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Genencor International 
STREET: 180 Kimball Way 
CITY: South San Francisco 
STATE: CA 
COUNTRY: USA 
ZIP: 94080 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS /MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/1 69, 94 8B 
FILING DATE: DEC 17 1993 
CLASSIFICATION: 4 35 
ATTORNEY/AGENT INFORMATION: 
NAME: Horn, Margaret A. 
REGISTRATION NUMBER: 33,401 
REFERENCE/DOCKET NUMBER: GC22 6 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 742-7536 
TELEFAX: (415) 7 42-7217 
INFORMATION FOR SEQ ID NO: 34: 
SEQUENCE CHARACTERISTICS: 



LENGTH: 218 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-169-948B-34 

Query Match 100.0%; Score 33; DB 2; Length 218; 

Best Local Similarity 100.0%; Pred. No. 67; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 NNLWG 5 

I I I I I 

Db 19 NNLWG 23 



RESULT 7 

US-08-448-873-34 

Sequence 34, Application US/08448873 
Patent No. 5874276 
GENERAL INFORMATION: 

APPLICANT: Fowler, Timothy 
APPLICANT: Ward, Michael 
APPLICANT: Clarkson, Kathleen 
APPLICANT: Collier, Katherine A. 
APPLICANT: Larenas, Edmund 

TITLE OF INVENTION: No. 5874276el Cellulase Enzymes and Systems 
TITLE OF INVENTION: For Their Expressions 
NUMBER OF SEQUENCES: 4 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Genencor International 
STREET: 180 Kimball Way 
CITY: South San Francisco 
STATE : CA 
COUNTRY: USA 
ZIP : 94080 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/448,873 
FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/169,948 
FILING DATE: 17-DEC-1993 
ATTORNEY/AGENT INFORMATION: 
NAME: Stone, Christopher L. 
REGISTRATION NUMBER: 35,696 
REFERENCE/DOCKET NUMBER: GC226D14 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 742-7555 
TELEFAX: (415) 742-7217 
INFORMATION FOR SEQ ID NO: 34: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 218 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-448-873-34 

Query Match 100.0%; Score 33; DB 2; Length 218; 

Best Local Similarity 100.0%; Pred. No. 67; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 NNLWG 5 

Mill 

Db 19 NNLWG 23 



RESULT 8 

US-08-382-452D-34 

Sequence 34, Application US/08382452D 
Patent No. 6268196 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Fowler, Timothy 
Clarkson, Kathleen A. 
Ward, Michael 
Collier, Katherine D. 
Edmund A. 

NOVEL CELLULOSE ENZYMES AND SYSTEMS 
FOR THEIR EXPRESSION 
43 



Larenas, 
TITLE OF INVENTION: 
TITLE OF INVENTION: 
NUMBER OF SEQUENCES: 
CORRESPONDENCE ADDRESS 

ADDRESSEE: Genencor International 
STREET: 925 Page Mill Road 
CITY: Palo Alto 
STATE : CA 
COUNTRY: USA 
ZIP : 94080 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS /MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1 25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08 /382 , 4 52D 
FILING DATE: February 1, 1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Christopher L. Stone 
REGISTRATION NUMBER: 36,696 
REFERENCE/DOCKET NUMBER: GC226-2 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 742-7555 
TELEFAX: (415) 742-7217 
INFORMATION FOR SEQ ID NO: 34: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 218 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-382-452D-34 



Query Match 100.0%; Score 33; 

Best Local Similarity 100.0%; Pred. No. 



Matches 5; Conservative 

Qy 1 NNLWG 5 

Mill 

Db 19 NNLWG 23 



DB 3; 
67; 



0; Mismatches 



Length 218; 
0; Indels 0; Gaps 



0; 



RESULT 9 
US-09-216-295-1 

Sequence 1, Application US/09216295 
Patent No. 6268328 
GENERAL INFORMATION: 
APPLICANT: Mitchinson, Colin 
APPLICANT: Wendt, Dan J. 

T F ™ R e™™cC555°- 626 " 28eI V " iant EGIII - LitS C ' llal "> Composition 

CURRENT APPLICATION NUMBER: US/09/216,2 95 
CURRENT FILING DATE: 1998-12-18 
NUMBER OF SEQ ID NOS : 41 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 1 
LENGTH: 218 
TYPE: PRT 

ORGANISM: Trichoderma longibrachiatum 



US-09-216-295-1 



Query Match 100.0%; Score 33; DB 3; Length 218; 

Best Local Similarity 100.0%; Pred. No. 67; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 NNLWG 5 

I I I I I 

Db 19 NNLWG 23 



RESULT 10 
US-08-507-362A-18 

Sequence 18, Application US/08507362A 
Patent No. 6562340 
GENERAL INFORMATION: 

APPLICANT: Bedford, Michael 
Morgan, Andrew 
Fowler, Timothy 
Ward, Michael 
Clarkson, Kathleen 
Collier, Katherine 
Larenas, Edmund 

TITLE OF INVENTION: An Enzyme Feed Additive and Animal Feed Including It 
NUMBER OF SEQUENCES: 21 g 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Genencor International 
STREET: 925 Page Mill Road 
CITY: Palo Alto 
STATE: CA 
COUNTRY: USA 
ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1 25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/507 , 362A 
FILING DATE: 27-Oct-1995 
CLASSIFICATION: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
NAME: Castaneda, Janet 
REGISTRATION NUMBER: 33,228 
REFERENCE/DOCKET NUMBER: GC22 6-3 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (650) 846-4072 
TELEFAX: (650) 845-6504 
INFORMATION FOR SEQ ID NO: 18: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 218 amino acids 
TYPE: amino acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 18* 
US-08-507-362A-18 

Query Match 100.0%; Score 33; DB 4; Length 218; 

Best Local Similarity 100.0%; Pred. No. 67; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 NNLWG 5 

I I I I I 

Db 19 NNLWG 23 



RESULT 11 
US-08-032-848C-13 

; Sequence 13, Application US/08032848C 
; Patent No. 5475101 



GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Ward, Michael 



Clarkson, Kathleen A. 
Weiss, Geoffrey L. 
Larenas, Edward 
Lorch, Jeffrey D. 
TITLE OF INVENTION: Purification and Molecular Cloning of 
TITLE OF INVENTION: EG III Cellulase 
NUMBER OF SEQUENCES: 20 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Genencor International 
STREET: 180 Kimball Way 
CITY: South San Francisco 
STATE: CA 
COUNTRY: USA 
ZIP : 94080 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/032 , 8 4 8C 
FILING DATE: MAR 17 1993 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: Horn, Margaret A. 
REGISTRATION NUMBER: 33,401 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415 742-7356 
TELEFAX: 415 742-7217 
INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 221 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-032-848C-13 

Query Match 100.0%; Score 33; DB 1; Length 221; 

Best Local Similarity 100.0%; Pred. No. 68; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 NNLWG 5 

I I I I I 

Db 21 NNLWG 25 



RESULT 12 
US-09-146-770-1 

; Sequence 1, Application US/09146770 

; Patent No. 6187732 

; GENERAL INFORMATION: 

; APPLICANT: Fowler, Timothy 

; TITLE OF INVENTION: Mutant EGIII Cellulase, DNA Encoding 

; TITLE OF INVENTION: Such EGIII Compositions and Methods for Obtainina 

; FILE REFERENCE: GC54 6 

; CURRENT APPLICATION NUMBER: US/09/14 6,770 
; CURRENT FILING DATE: 1998-09-03 
; NUMBER OF SEQ ID NOS : 4 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 1 

LENGTH: 232 

TYPE: PRT 

ORGANISM: T. reesei 
US-09-146-770-1 

Query Match 100.0%; Score 33; DB 3; Length 232; 

Best Local Similarity 100.0%; Pred. No. 71; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 



* Qy 1 NNLWG 5 

I I I I I 

Db 35 NNLWG 39 



RESULT 13 
US-09-633-084-1 

Sequence 1, Application US/09633084 
Patent No. 6407046 
GENERAL INFORMATION: 
APPLICANT: Fowler, Timothy 

TITLE OF INVENTION: Mutant EGIII Cellulase, DNA Encoding 

TITLE OF INVENTION: Such EGIII Compositions and Methods for Obtaining Same 
FILE REFERENCE: GC54 6 

CURRENT APPLICATION NUMBER: US/09/633,084 
CURRENT FILING DATE: 2000-08-04 
PRIOR APPLICATION NUMBER: 09/146,770 
PRIOR FILING DATE: 1998-09-03 
NUMBER OF SEQ ID NOS : 4 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 1 
LENGTH: 232 
TYPE: PRT 

ORGANISM: T. reesei 
US-09-633-084-1 

Query Match 100.0%; Score 33; DB 4; Length 232; 

Best Local Similarity 100.0%; Pred. No. 71; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 NNLWG 5 

I I I I I 

Db 35 NNLWG 39 

RESULT 3 
S12610 

cellulase (EC 3.2.1.4) precursor - Aspergillus aculeatus 
N; Alternate names: endo-1, 4-beta-glucanase 
C;Species: Aspergillus aculeatus 

C;Date: 07-Apr-1994 #sequence_revision 07-Apr-1994 #text change 21-Jul-2000 
C;Accession: S12610; S14118; S40186; JQ0458 

R;Ooi, T.; Shinmyo, A.; Okada, H.; Murao, S.; Kawaguchi, T.; Arai, M 
Nucleic Acids Res. 18, 5884, 1990 

^J^niV C ° mplete nucleotide sequence of a gene coding for Aspergillus aculeatus cellular 
( h 1-CMCase) . > 

A/Reference number: S12610; MUID : 91016934 ; PMID:2216782 
A; Access ion: SI 2 610 
A; Molecule type: DNA 
A;Residues: 1-237 <OOIl> 

A;Cross-references: EMBL:D00546; NID:g217818; PIDN: BAA00435 . 1; PID:g217819 
R;Ooi, T.; Shinmyo, A.; Okada, H.; Hara, S.; Ikenaka, T.; Murao, S.; Arai, M 
Curr. Genet. 18, 217-222, 1990 

A;Title: Cloning and sequence analysis of a cDNA for cellulase (FI-CMCase) from Aspergillus 
acu±eatus . 

A;Reference number: S14118; MUID : 91064758; PMID:2249253 
A;Accession: S14118 
A; Molecule type: mRNA 
A;Residues: 1-237 <OOI2> 

A;Cross-references: EMBL:X52525; NID:g2287; PIDN: CAA36757 . 1 ; PID-g2288 
A;Accession: S40186 
A;Molecule type: protein 

A;Residues: 17-18;42-49, 'X', 51-54, 'X' ; 66-79; 90-111; 136-205, 'XX', 208-211 <OOI3> 
C;Genetics : 

A;Introns: 138/2; 212/1 
C; Function: 

and^S^ln 0 ^^ 0 ^ 5 ! 3 ° f ^J-beta-D-glucosidic linkages in beta-D-glucans such as cellulose 
A-P a Jh^f, i? ? W™^™ such linkages m beta-D-glucans that also contain 1,3-linkages 
A,fatnway: cellulose degradation 

C; Keywords :glycosidase; hydrolase; polysaccharide degradation; pyroglutamic acid 
i?;l-16/Domam: signal sequence #status predicted <SIG> 
F;17-237/Product: cellulase #status experimental <MAT> 




F;17/Modifieci site: pyrrolidone carboxylic acid (Gin) (in mature form) #status experimental 



Query Match 100.0%; Score 33; DB 2; Length 237; 

Best Local Similarity 100.0%; Pred. No. 46; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 NNLWG 5 

I I I I I 

Db 37 NNLWG 41 



RESULT 1 
GUN_ASPAC 

ID GUN_ASPAC STANDARD; PRT; 237 AA 

AC P22669; 

DT 01-AUG-1991 (Rel. 19, Created) 

DT 01-AUG-1991 (Rel. 19, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Endoglucanase I precursor (EC 3.2.1.4) (Endo-1 , 4-beta-glucanase) 

DE (Cellulase) (FI-CMCASE) . 

OS Aspergillus aculeatus. 

OC Eukaryota; Fungi; Ascomycota; Pezizomycotina; Eurotiomycetes; 

OC Eurotiales; Trichocomaceae; mitosporic Trichocomaceae; Asperqillus 

OX NCBI_TaxID=5053; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=F-50; 

RX MEDLINE=91016934; PubMed=2216782 ; 

RA Ooi T., Shinmyo A., Okada H., Murao S., Kawaguchi T., Arai M. ; 

RT "Complete nucleotide sequence of a gene coding for Aspergillus 

RT aculeatus cellulase ( FI-CMCase ) . " ; 

RL Nucleic Acids Res. 18:5884-588 4(1990) 

RN [2] 

RP SEQUENCE FROM N.A., AND PARTIAL SEQUENCE. 

RC STRAIN-F-50; 

RX MEDLINE=91064758; PubMed=224 9253 ; 

RA Ooi T., Shinmyo A., Okada H., Hara S., Ikenaka T . , Murao S , 

RA Arai M . ; 

RT "Cloning and sequence analysis of a cDNA for cellulase (FI-CMCase) 

RT from Aspergillus aculeatus."; 

RL Curr. Genet. 18:217-222(1990). 

CC -!- CATALYTIC ACTIVITY: Endohydrolysis of 1 , 4-beta-D-glucosidic 
CC linkages in cellulose, lichenin and cereal beta-D-glucans 

nn -■- SUBCELLULAR LOCATION: Secreted. 

INDUCTION: By cellulosic materials and hemicelluloses . 
MISCELLANEOUS: Will also hydrolyze 1,4-linkages in beta-D-glucans 
CC also containing 1 , 3-linkages . 

CC -!- SIMILARITY: BELONGS TO CELLULASE FAMILY H (FAMILY 12 OF GLYCOSYL 
CC HYDROLASES). 



CC 
CC 
CC 



CC 



DR 
DR 
DR 
KW 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 
US !.. by non "P rofit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) 

cc 

DR EMBL; D00546; BAA00435.1; -. 
DR EMBL; X52525; CAA36757.1; -. 
DR PIR; S12610; S12610. 

InterPro; IPR002594; Glyco_hydro_12 . 
Pfam; PF01670; Glyco__hydro_12 ; 1. 
ProDom; PD004316; Glyco_hydro_12; 1. 

Cellulose degradation; Hydrolase; Glycosidase; Signal; 
KW Pyrrolidone carboxylic acid. 
FT SIGNAL 1 16 POTENTIAL. 

FT CHAIN 17 237 ENDOGLUCANASE I. 

FT MOD_RES 17 17 PYRROLIDONE CARBOXYLIC ACID 

SQ SEQUENCE 237 AA; 25560 MW; 8F173571A8AE6931 CRC64; 



Query Match 100.0%; Score 33; DB 1; Length 237; 

Best Local Similarity 100.0%; Pred. No. 18; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 NNLWG 5 

I I I I I 

Db 37 NNLWG 41 



RESULT 6 
000095 

ID 000095 PRELIMINARY; PRT; 234 AA 

AC 000095; 

DT 01-JUL-1997 (TrEMBLrel . 04, Created) 

DT 01-JUL-1997 (TrEMBLrel. 04, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Endo-beta-1, 4-glucanase (EC 3.2.1.4). 

GN EGL . 

OS Trichoderma reesei (Hypocrea jecorina) . 

OC Eukaryota; Fungi; Ascomycota; Pezizomycotina; Sordariomycetes; 

OC Hypocreales; Hypocreaceae; Hypocrea. 

OX NCBI_TaxI D=5 1453; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=QM9414; 

RA Okada H., Tada K. , Sekiya T., Yokoyama K. , Takahashi A., Tohda H , 

RA Kumagai H., Morikawa Y.; 

RT "Molecular characterization and heterologous expression of the gene 

RT encoding a low-molec ular-mass endoglucanase from Trichoderma reesei 

RT QM9414."; 

RL Appl. Environ. Microbiol. 64:55-563(1998) 

DR EMBL; AB003694; BAA20140.1; -. 

DR InterPro; IPR002594; Glyco_hydro_12 . 

DR Pfam; PF01670; Glyco_hydro_12 ; 1. 

DR ProDom; PD004316; Glyco_hydro__12 ; 1. 

KW Glycosidase; Hydrolase. 

SQ SEQUENCE 234 AA; 25159 MW; DF4 7 6EEDE384ADD1 CRC64; 

Query Match 100.0%; Score 33; DB 3; Length 234; 

Best Local Similarity 100.0%; Pred. No. 1.3e+02; 

Matches 5; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 NNLWG 5 

I I I I I 

Db 35 NNLWG 39 



RESULT 7 
Q8NJY5 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 



PRELIMINARY; 



OC 
OX 
RN 
RP 
RX 
RA 
RA 
RT 
RT 
RL 
DR 
DR 
DR 



PRT; 234 AA. 
Created) 

Last sequence update) 
Last annotation update) 



Q8NJY5 
Q8NJY5; 

01-OCT-2002 (TrEMBLrel^ 22, 
01-OCT-2002 (TrEMBLrel 
01-MAR-2003 (TrEMBLrel. 
Endoglucanase . 
CEL12A. 

Hypocrea koningii . 

Eukaryota; Fungi; Ascomycota; pezizomycotina; Sordariomycetes; 
Hypocreales; Hypocreaceae; HypoWea. . 
NCBI_TaxID=97093; \ / 

[1] \ / 

SEQUENCE FROM N.A. 

MEDLINE=22067395; PubMed=12073090 u 

Goedegebuur F., Fowler T., Phillies V., van der Kley P., 
van Solingen P., Dankmeyer L., p/wer Vd.; 
"Cloning and relational analysi/ of 15 \ovel fungal endoglucanases 
from family 12 glycosyl hydrolase." 
Curr. Genet. 41:89-98(2002) 
EMBL; AF435069; AAM77712.1; 
InterPro; IPR002594; Glyco hydro_12, 
Pfam; PF01670; Glyco_hydro7i2 ; 1. 



DR ProDom; PD004316; Glyco_hydro_12 ; 1. 

SQ SEQUENCE 234 AA; 25299 MW; 4A£E#486C29FFC44 CRC64; 



Query Match 100, 
Best Local Similarity 100 
Matches 5; Conservatj 

Qy 1 NNLWG 5 

I I I I I 

Db 35 NNLWG 39 



0%;^^t^e 33; DB 3; Length 234; 
Pred>NQ. 1.3e+02; 
; Mismatches^ 0; Indels 0; 



Gap 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RX 
RA 
RA 
RT 
RT 
RL 
DR 
DR 
DR 
DR 
SQ 



PRELIMINARY; 

(TrEMBLrel 
(TrEMBLrel^ 
(TrEMBLrel, 



RESULT 8 
Q8NJY2 
ID Q8NJY2 
Q8NJY2; 
01-OCT-2002 
01-OCT-2002 
01-MAR-2003 
Endoglucanase 
CEL12B . 

Aspergillus awamori (var. 
Eukaryota; Fungi; Ascomyco 
Eurotiales; Trichocomacea 
NCBI_TaxID=40384; 
[1] 

SEQUENCE FROM N . A. 
MEDLINE=22067395; PubMe / 6>1207309 
Goedegebuur F., Fowler/T., Phillip > 
van Solingen P., DankMeyer L., Powe 
"Cloning and relational analysis of 
from family 12 glyco/yl hydrolase.". 
Curr. Genet. 41:89-/8(2002). 
EMBL; AF435072; AAW77715.1; 
InterPro; IPR00259/; Glyco_hydro_12 . 
Pfam; PF01670; Glyco_hydroJL2 ; l7 
ProDom; PD004316;/ Glyco_hydro_12 ; 1. 
SEQUENCE 237 A*; 25710 MW; 4DBDC8563E7CD 



PRT;/ .' 237 AA. 
Created) 

Last sequence update) 
La^t annotation update) 



'wachi) . 

Pezizomycotina; Eurotiomycetes; 
tosporic Trichocomaceae; Aspergillus. 



van der Kley P. , 
).; 

5 novel fungal endoglucanases 



11 CRC64; 



Query Match / 100.0%; Score 33; 

Best Local Similarity 100.0%; Pred. No. 



Matches 5; Conservative 

Qy 1 NNLWtf 

I I I I 

Db 37 NNLWG 41 



DB 3; N^ength 237; 
1.4e+02; 



0; Mismatches 



0; 



Indels 

\ 



0; Gaps 



RESULT 
013454 



ID 

AC 

DT 

DT 

DT 

DE 

GN 

OS 

OC 

OC 

OX 

RN 

RP 

RC 

RX 

RA 

RA 

RT 

RT 

RL 

DR 



013454 

013454; 

01-JAN-1998 

01-JAN-1998 

01-OCT-2002 



PRELIMINARY; PRT; 239 AA. 

(TrEMBLrel. 05, Created) 
(TrEMBLrel. 05, Last sequence update) 
{TrEMBLrel. 22, Last annotation update) 
(EC 3.2.1.4) . 



Endo-1, 4-beta-glucanase 
CELA. 

Aspergillus oryzae. 

Eukaryota; Fungi; Ascomycota; Pezizomycotina; Eurotiomycetes; 
NCBi*^ mitosporic Trichocomaceae; Aspergillus. 

[1] 

SEQUENCE FROM N.A. 
STRAIN=KBN616; 

MEDLINE-971 61783; PubMed=9008887 ; 

Kitamoto N., Go M., Shibayama T., Kimura T., Kito Y 
Tsukagoshi N. ; 

"Molecular cloning, purification and characterization 
beta-glucanases from Aspergillus oryzae KBN616. 
Appl. Microbiol. Biotechnol. 46:538-544(1996) 
EMBL; D83731; BAA22588.1; -. 



Ohmiya K. , 
of two endo-1, 4 



DR InterPro; IPR002594; Glyco_hydro 12. 

DR Pfam; PF01670; Glyco_hydro__12 ; l7 

DR ProDom; PD004316; Glyco_hydro_12; 1. 

KW Glycosidase; Hydrolase. 

SQ SEQUENCE 239 AA; 26096 MW; C0F850E5DFEB455D CRC64 ; 

Query Match 100.0%; Score 33; DB 3; Length 239; 

Best Local Similarity 100.0%; Pred. No. 1.4e+02; 
Matches 5; Conservative 0; Mismatches 0; Indels 

Qy 1 NNLWG 5 

I I I II 

Db 35 NNLWG 39 



